File Version Based Continuous Data Protection on Distributed Object Storage

نویسندگان

  • Xin Yang
  • Ning Jing
  • Jiangjiang Wu
  • Jun Li
چکیده

Continuous Data Protection (CDP) can restore data to any point-in-time, but high storage overhead and drastic system performance drop restricts its application. In this paper, we propose a file version based file level CDP system (FV-CDP) by using cheap distributed storage for backup to low down the storage costs and using local object cache and parralel asynchronous object sending to mask network storage latency. It designs special opration log to identify the file system hierarchy at any point-in-time and exploits parallel restoring in filesystem recovery. The experimental results show that parallel asynchronous objects sending makes the FV-CDP system max write ops to get improved by about 3.4 times, and the parallel recovery reduces file system recovery time by up to 57%. Under high frequency file syetem change workload, FVCDP causes a large storage space overhead.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Data Grids Performance by Using Modified Dynamic Hierarchical Replication Strategy

Abstract: A Data Grid connects a collection of geographically distributed computational and storage resources that enables users to share data and other resources. Data replication, a technique much discussed by Data Grid researchers in recent years creates multiple copies of file and places them in various locations to shorten file access times. In this paper, a dynamic data replication strate...

متن کامل

Intrinsic References in Distributed Systems

distributed, storage, hash function The notion of intrinsic references, i.e. references based on the hash digest of the referent, is introduced and contrasted with that of physical references, where the referent is defined relative to the state of a physical system. A retrieval mechanism using intrinsic references, the Elephant Store, is presented. The use of intrinsic references in hierarchica...

متن کامل

An Efficient Data Replication Strategy in Large-Scale Data Grid Environments Based on Availability and Popularity

The data grid technology, which uses the scale of the Internet to solve storage limitation for the huge amount of data, has become one of the hot research topics. Recently, data replication strategies have been widely employed in distributed environment to copy frequently accessed data in suitable sites. The primary purposes are shortening distance of file transmission and achieving files from ...

متن کامل

Snapshots in large-scale distributed file systems

Snapshots are present in many modern file systems, where they allow to create consistent on-line backups, to roll back corruptions or inadvertent changes of files, and to keep a record of changes to files and directories. While most previous work on file system snapshots refers to local file systems, modern trends like cloud and cluster computing have shifted the focus towards distributed stora...

متن کامل

Secure and Fault Tolerant Distributed Framework with Mobility Support

In this paper, we propose an architecture of distributed data storage framework that incorporates fault tolerance, mobility support, and security. Main goal of our system is to provide equal opportunities for both connected and disconnected clients. Consequence is that mutual exclusion may not be involved. Data storage systems without mutual exclusion suffer from update and name conflicts. We a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017